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Abstract 

In this paper we propose different schemes for option hedging when asset returns are modeled by dynamics 
from a general class of GARCH models. Since the minimal martingale measure fails to produce a probability 
measure in this setting, we construct local risk minimization hedging strategies with respect to a risk neutral 
measure. Local risk minimization is investigated in the context of Gaussian driven models, and two other 
minimum variance hedges are proposed in order to extend Duan's [31] delta hedge. These two methods are 
constructed using local risk minimizing hedges for bivariate diffusion limits of GARCH models. Numerical 
experiments are carried out in order to compare the different hedging schemes; in particular, the sensitivity 
of the hedging strategies with respect to several pricing measures is tested for a special class of non-Gaussian 
GARCH models for European style options with different moneyness and maturities. 

Keywords: GARCH models, hedging scheme, local risk minimization, conditional Esscher transform, Extended 
Girsanov Principle, bivariate diffusion limit, minimum variance hedge. 

1 Introduction 

Many empirical studies have shown strong evidence against some of the underlying assumptions of the Black-Scholes 
model [12 , in particular the constant value that is assumed for the volatility. The incorporation of a stochastic 
volatility in modelling the dynamics of asset returns plays a major role in explaining some of the stylized properties 
of financial time series. Consequently, the pricing and hedging of options based on stochastic volatility (SV) models 
have been frequently studied in the mathematical finance literature in both discrete and continuous time. Discrete- 
time discussions have been dominated by the Generalized Autoregressive Conditionally Heteroskedastic (GARCH) 
models introduced by Engle [JT] and Bollerslev |13j . 

The majority of the GARCH option pricing literature relies on the locally risk neutralized valuation relationship 
(LRNVR) proposed by Duan [31 . This principle is constructed using an equilibrium argument and generalizes the 
risk- neutral valuation methodology first proposed by Rubinstein [fJ5] and later developed by Brennan |15j . The key 
assumption behind the construction of the pricing kernel is the conditionally bivariate Gaussian distribution of the 
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asset returns and changes in the aggregate consumption. Therefore, option pricing using this approach can only be 
performed within the restrictive family of GARCH models driven by normal innovations. Various ways have been 
proposed for pricing European style options under such models. The traditional approach is to evaluate the risk 
neutral expectations of discounted payoffs by combining standard Monte-Carlo simulations with different variance 
reduction techniques, (see for example, Duan [3T], Duan et al [371 Christoffersen and Jacobs [2T], Hsieh and 
Ritchken [5 3) among many others). Another direction has been proposed by Heston and Nandi [51] who obtain 
semi closed form solutions for European option prices when assets are modeled by afhne Gaussian GARCH models. 

Although GARCH models provide a better setting than their homoskedastic alternatives, the Gaussian assump- 
tion is too restrictive to allow the capture of the evolution of the asset returns and the anomalies observed in the 
market option quotes. Therefore, different interrelated extensions have been proposed recently in the literature. 
They can be classified in two main directions. Firstly, new GARCH option pricing models based on a plethora of 
skewed and leptokurtic non-Gaussian innovations have been suggested; for example, the General Error Distribu- 
tion in Duan [33], the Inverse Gaussian in Christoffersen et al. [23J, the Normal Inverse Gaussian in Stentoft [55] . 
the Generalized Hyperbolic in Chorro et al. |20) . and Gaussian mixtures in Badescu et al. |3J or Stentoft and 
Rombouts |69|, among others. Secondly, semi or non-parametric pricing models have also been proposed; see for 
example Badescu and Kulperger [3J and Barone-Adesi et al. [5] . For further recent advances in the GARCH option 
pricing literature we refer to the survey paper by Christoffersen et al. [26] and references therein. 

Since markets described by all these discrete-time models are incomplete, a natural question that needs to be 
answered is the choice of the stochastic discount factor for pricing purposes. Indeed, considering new risk neutral 
measures is a necessity since the LRNVR cannot be applied outside the Gaussian framework, even though it should 
be mentioned that Duan provided a generalization of the LRNVR in Duan (34] using an equilibrium argument and 
invoking conditional bivariate Gaussian assumptions on a function of the asset returns and aggregate consumptions; 
Badescu et al. [5] have shown that this transformation is closely related to a Wang [70] type transform often used 
for pricing insurance contracts in the actuarial science literature. This pricing kernel has been empirically tested 
by several studies that include Christoffersen et al. [22] and Stentoft [68] . 

A good candidate among the class of martingale measures is the conditional Esscher transform that was first 
proposed in a general GARCH setting by Siu et al. [67j and has been further investigated in terms of option prices by 
other authors, such as Badescu and Kulperger [3], and Christoffersen et al. [25]. Gourieroux and Monfort [5U] also 
used this exponentially affine stochastic discount factor for a more general class of discrete-time models. Badescu 
et al. [5\ provided conditions under which this change of measure is consistent with pricing kernels constructed on 
consumption CAPM models. Another option concerning the choice of risk neutral measure was proposed by Elliott 
and Madan [40] as a discrete-time generalization of the Girsanov theorem; this direction has been empirically 
investigated in Badescu et al. [3J. Finally, GARCH option pricing models based on second order conditional 
Esscher transforms and variance dependent kernels have been recently proposed by Monfort and Pegoraro [60] and 
Christoffersen et al. [24 , respectively. Unlike the previously described martingale measures, these require the use 
of option data, due to the presence of extra parameters which are not estimated based only on historical return 
data. 
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Despite their popularity in terms of option pricing, GARCH models have rarely been used for option hedging 
purposes. The first such proposal can be found in the paper by Duan |31| which introduced a delta hedge obtained 
as the partial derivative of the option price with respect to underlying and computes it using Monte Carlo simulation 
for evaluating the corresponding LRNVR expectation. Other approaches for computing this hedge ratio include 
the finite difference method proposed by Engle and Rosenberg 42 or the adjusted Black Scholes delta hedging 
computations based on analytic approximations of GARCH option prices (see Choi [17]). Kallsen and Taqqu [55] 
proposed a continuous time GARCH process under which the associated markets are complete and also investigated 
hedging in the Black-Scholes sense. Duan's [31] definition of the delta hedge has been subjected to several criticisms, 
the most important coming from Garcia and Renault |48] who pointed out that this specification ignores the fact 
that the option price depends also on the conditional variance, which is itself a function of the asset price. In order 
to take into account the stochastic volatility environment of GARCH models, Engle and Rosenberg [43] considered 
the computation of the option's deltas and gammas using the chain rule and studied the modifications in empirical 
performance when hedging options with changing variance term structure for different volatility specifications. 

The techniques briefly described above try to extend the hedging designed for complete markets to a GARCH 
context, and therefore they are not optimal in the incomplete market setting. Moreover, the use of continuous time 
hedging ratios for discrete-time models may generate high hedging errors. Thus, various other hedging techniques 
have been proposed in the financial literature by minimizing different measures for the hedging error, such as 
quadratic functionals. One of the most influential strategies in this direction has been the local risk minimization 
scheme introduced by Follmer, Schweizer, and Sondermann (see [4511471155] . and references therein); this approach 
uses the notion of sequential regressions in order to construct a generalized trading strategy (not necessarily self- 
financing) that minimizes daily squared hedging errors. Local risk minimization is particularly convenient in the 
discrete time setup because it is well adapted to recursive algorithmic implementations and can be tuned to minimize 
hedging errors related to arbitrary prescribed hedging frequencies. The standard approach for constructing local 
risk minimization hedges is to minimize the conditional expected value of the quadratic cost of hedging under the 
historical probability measure. This optimization procedure also gives rise to a contingent claim price computed 
in the risk neutral world with the well-known minimal martingale measure. Although local risk minimization has 
been widely used for pricing and hedging financial derivatives based on continuous time models, its implementation 
in discrete-time, and in particular for GARCH models, raises some issues. For example, in Ortega |62j it has 
been shown that in the GARCH context it is only under very restrictive hypotheses (the model innovations are 
required to be bounded) that option prices arising from local risk minimization using the physical measure are 
guaranteed to be arbitrage free. Ortega also shows that in the absence of those hypotheses, the minimal martingale 
measure, whose existence ensures that the local risk minimizing price is free of arbitrage, is signed. Additionally, 
a number of situations have been identified (see e.g. Cerny and Kallsen |17j ) in which the local minimization of 
squared hedging errors does not guarantee the minimization of the final unconditional expected squared hedging 
error; this point has motivated recent important work which target the minimization of the global hedging risk (see 
e.g. Cerny and Kallsen [17j [TBI US] ) • Therefore, in order to address some of these issues, quadratic hedging with 
respect to a martingale measure has been recently proposed. When the local risk minimization is carried out with 
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respect to a martingale measure, a number of interesting features arise. For example, the option prices that result 
from the scheme arc guaranteed to be arbitrage free and the hedging strategies in this case minimize, not only the 
local but also the global hedging risk. Additionally, there is a decoupling between the expressions that determine 
hedges and prices that can be taken advantage of in order to create numerically performing schemes to compute 
them. Additional advantages of using this approach are discussed in Section |3.1[ an intuitive explanation behind 
risk neutral optimization and references to other research studies which undertake this approach are presented in 
Cont et al. |28j . To our knowledge, the only attempts at exploring local risk minimization in a GARCH setting are 
due to Ortega |62| and Huang and Guo |55j . The former investigates the use of the local risk minimization with 
respect to a martingale measure in the context of Gaussian driven GARCH processes and shows how Duan |31j 
and Heston and Nandi's |5T] pricing formulas can be recovered in this fashion, while the latter investigates the risk 
adjusted expected quadratic cost of hedging minimal hedging strategies in the sense of Elliott and Madan [40] . The 
empirical findings of these studies show that such techniques provide a viable alternative to the standard hedge 
ratios used for GARCH models. 

In this paper we provide a unified framework for the local risk minimization and the standard continuous-time 
delta hedges for a general class of GARCH models. Firstly, we introduce local risk minimization with respect to a 
martingale measure for different hedging frequencies under a GARCH model with an unspecified distribution for 
the driving noise. We present the main advantages of using such a scheme. Although this construction is valid 
for any risk neutral measure, we shall focus on the conditional Esscher transform and on the Extended Girsanov 
Principle for our numerical examples. These two measures are identical and coincide with the one coming from 
Duan's LRNVR [3T] for normally driven GARCH processes, so it is only in the case of non-Gaussian driven models 
where the effect of the risk neutral measure on the hedging performance will be investigated. 

When the time between two consecutive observations approaches zero, we show that both the local risk min- 
imization hedge ratio under a martingale measure and the standard one-step ahead minimum variance GARCH 
hedge taken with respect to the historical measure can be approximated by a stochastic volatility type delta hedge 
computed using a chain rule differentiation for the total derivative. More specifically, the resulting delta consists of 
the sum of the option delta (the first partial derivative of the option price with respect to the spot price) and the 
option vega (the first partial derivative of the option price with respect to the conditional variance process) scaled 
by a so-called vega multiplier (VM). The name of this term is borrowed from Engle and Rosenberg (33]. However, 
this approximation is different from the one proposed by Engle and Rosenberg |43j since all our hedge ratios and 
correction terms are computed directly based on a GARCH model instead of on an adjusted Black-Scholes price in 
the sense of Hull and White [55] . Further derivations for the GARCH delta and vega are computed in the case of 
a Gaussian model with a generalized volatility dynamic, but these quantities require Monte Carlo simulation for 
their evaluation. For a particular case of a Gaussian asymmetric GARCH model which contains a leverage effect, 
we derive the corresponding local risk minimization hedge of its continuous time bivariate diffusion limit taken 
in the sense of Duan [55] . This result may also serve as a further justification of why Duan's [3T] hedge is the 
correct instantaneous delta hedge only when changes in today's return and tomorrow's variance are conditionally 
uncorrelated. However, for the levered Gaussian GARCH model, the corresponding VM is non-zero. Thus an 
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adjustment to Duan's delta hedge has to be made. In this direction, we propose two alternative hedging schemes: 
the first one is based on the continuous time limit of the GARCH VM and the second uses the standard chain 
rule differentiation, so the VM is calculated as the derivative of the variance process with respect to the asset 
price. Our numerical experiments suggest that the local risk minimization and the delta hedge computed based 
on the continuous time approximations outperform the other hedging scheme considered for almost all maturities, 
moneyness and hedging frequencies considered. The performance measure used for our comparison purposes is 
based on the mean square hedging error (MSHE) constructed from replicating European call options. Next, we 
compare the hedging performance of the local risk minimization and Duan's delta hedge when the returns are mod- 
eled with a Normal Inverse Gaussian GARCH model under the conditional Esscher transform and the Extended 
Girsanov Principle. Similar numerical results suggest that there arc no significant differences between the hedging 
schemes when these two pricing measures are used. Additionally, our findings suggest that overall, the local risk 
minimization taken with respect to a martingale measure outperforms Duan's delta hedge for at-the-money and 
out-of-money contracts and in the cases when the portfolio rebalancing is made less frequently. 

The paper is organized as follows. Section [2] characterizes the risk neutral GARCH dynamics for the pricing 
kernel used. Section [3] presents the local risk minimization and its relationship with the continuous time hedging 
schemes. In Section [4] we explicitly compute the delta and the vega hedges for GARCH models and show the 
relationship between local risk minimization under GARCH and its continuous time limit. Numerical experiments 
that compare the performances of the different hedging schemes are provided. Section [5] studies different imple- 
mentations of the local risk minimization hedging scheme for models with Normal Inverse Gaussian innovations 
using the martingale measures which were introduced in Section [2] Section [6] concludes the paper. 

2 GARCH(1,1) models and martingale measures 

Consider a discrete time financial market with the set of trading dates indexed by T = {t\t = 0, . . . , T}. The market 
consists of one reference and one risky asset. We denote by P the underlying probability measure and we assume 
that the dynamics of these assets are modeled by the following bivariate stochastic process (S°, S) = {(5°, St)}t^r- 
The risk-free asset is a deterministic process and it evolves over time according to the following equation: 



Here r represents the constant one-period continuously compounded risk free rate of return. We also denote by St 
the discounted asset price process from time r = t to r = 0, so St = e~ rt St- 

The log-return on the risky asset process, denoted by y — {yt}teT '■= { m (S t / St-i)}t&Ti nas a general 
GARCH(1,1) structure with the P-dynamics given below: 





(2.1) 



y t = m+a t e t , H^Ft-x ~ D(0, 1) 
of = ao +ai0f-iw(e t _i) +/3i<7f_ 1 . 



(2.2) 



(2.3) 



The following assumptions and notations complete our description under P: 
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• The uncertainty modeled through the complete probability space (Q,T,P) has a filtration T :— {Ft}teTi 
where Tt represents the a-field of all market information available up to time t. 

• Tt = cr(ei, . . . , et) where {tt}t£T is a sequence of random variables such that for each t, e t is independent 
and identically distributed conditional on J- t with a standard distribution Z?(0, 1) under P; the two numbers 
in the symbol D(0, 1) indicate that this distribution has mean and variance 1. We denote by /£(•) and 

(•) the J^-i-conditional probability density function and cumulative distribution function of e t under P, 
respectively; we further assume that its conditional cumulant generating function exists in a neighborhood 
of zero: 

nf t (z) = lnB^Je^] < oo, z e (-u,u), u > 0. 

• The conditional mean log-return fj, t is J-t—i measurable. Different mean specifications, similar to the ones 
proposed in Duan |31j or Christoffersen et al [25] . are discussed in the following sections after parametric 
assumptions for the driving noise are made. 

• The conditional variance equation has a GARCH(1,1) structure with the news impact curve modeled through 
the function w(-). The GARCH parameters are assumed to satisfy the standard second order stationarity 
conditions. 

The following lemma summarizes two results later used in this paper for constructing locally risk minimizing 



hedging strategies under P. The proof involves basic algebraic manipulations of conditional moments of (2.2) and 
is therefore omitted. 

Lemma 2.1 The physical mean and variance of changes in discounted asset prices are given by: 



E 1 



Var p 



St — St-i I Tt—\ 
St — St-i I Ft-i 



= e^- rt S t -i (e< (fft) - e r -^) , (2.4) 
= e^-^S^ \e<^ - e a < («•«)] . ( 2 .5) 



It is well known that, in the absence of arbitrage, the price of any contingent claim can be expressed as the 
discounted expected value of its payoff at maturity under a P - equivalent risk neutral probability measure. The 
discrete-time structure of our model makes the market incomplete and, therefore, there exist an infinite number 
of such pricing measures. In order to complete the characterization of our GARCH(1,1) pricing model we need to 
specify the risk neutralized dynamic of the log-returns under such a measure. Thus, we propose below the following 
general GARCH(l,l)-type structure for the log-return process under a probability martingale measure generically 
denoted by Q: 

Vt = r-K%(o-* t ) + o-;e* t , e t *|J- t _! ~ D* (0,1) (2.6) 
<r* 2 = aS t + a lt a?V*(Ci) + £W-i- (2-7) 



The above equations require some explanation and further comments as follows: 
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• All risk neutral quantities are illustrated through a "*" notation, so that of 2 represents the risk neutral 
conditional variance of the log-return process, while k^(-) is the Tt-\- conditional cumulant generating 
function of e\ under Q. 

• The GARCH - type parameters are no longer constants under the new measure. Instead, in order to keep 
the predictability structure of the conditional variance under Q, we assume that ag t , ot\ t and /3* t are all T%—\ 
measurable. 

• The standard innovation process under the martingale measure is assumed to have a conditional distribution 
under the martingale measure D*(0,1) different from D(0, 1). However, in many practical examples it is 
often the case that D* and D coincide. 



We can now formulate the results from Lemma |2.1| in the risk neutral case. The proof follows immediately by 
replacing the risk neutral return specifications into Lemma |2.1| . 



(2.8) 
(2.9) 



Lemma 2.2 The risk neutral mean and variance of changes in discounted asset prices are given by: 



E Q 
Var Q 



St — St-i I J- t-i 
St — St-i I Tt-\ 



= o, 



-2r(t-l) c2 

°t-l 



K« (2<7 t *)-2 K « (a*) 

e t 't — l 



Note that equation (2.8) is a direct consequence of the fact that discounted asset prices are martingales under Q. 



The dynamics from (2.6)-(2.7) provide us with a very general GARCH characterization in the risk neutral world. 



This structure nests some of the most popular martingale measures used in the GARCH option pricing literature. 
In the remainder of this section we show that both the conditional Esscher transform and the Extended Girsanov 
Principle give rise to risk neutral dynamics which are special cases of our proposed model. 



2.1 Extended Girsanov Principle 

An extended Girsanov principle has been proposed by Elliott and Madan [40] as an alternative pricing measure 
for discrete time models. As its name suggests, the construction of this measure is based on an extension of the 
Girsanov transformation to non-Gaussian settings according to which asset prices will have a different mean return 
and an unchanged volatility after the change of measure. This measure has been used for option pricing by Badescu 
and Kulperger [3] in a semiparametric GARCH setting and Badescu et al [I] for Gaussian mixture GARCH models. 

Although the Radon-Nikodym derivative is defined in terms of the martingale component process from the 
multiplicative Doob decomposition of the discounted asset price, we follow Badescu et al [5] and characterize the 
risk neutralized dynamic for our proposed generalized GARCH setup. The resulting equations can be viewed as a 



special case of ( 2.6 )-( 2.7 1 



Proposition 2.3 Define the stochastic process N := {N t }teT : 



N t = 



/t P (et + gt) 
ffiet) : 
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Here {fft}teT * s an -Ft -predictable process representing the market price of risk given by: 



Qt = — . (2.10) 



Then, the following statements hold: 



t 

(i) The process Z := {Z t }teT defined by Z t '■= Yi with Zq — 1 is a P -martingale and Zt defines an equivalent 

fe=i 

measure Q by Zt = jp ■ 

(ii) The innovation process e* := {ej}tgT with el = e t + Qt is a sequence of Tt-\- conditionally uncorrelated zero 

mean and unit variance random variables under Q. 

(iii) The risk neutralized asset return dynamic is given by: 

Vt = r-K^(a t )+a t e* t , e*~D(0,l), 
o-\ = a + a 1 at_ 1 oj{e* t _ 1 - g t -i) + $\<j\_ x . 

In order to differentiate among the risk neutral measures used throughout this paper, we further denote the measure 
associated to the Extended Girsanov transform by Q egp - For completeness, the proof of this proposition is provided 
in the Appendix. 



It is not difficult to see that the above dynamic is a particular case of ( 2.6 )-( 2.7 ) when the risk neutral innovation 
is shifted by the risk premium process and the conditional variance under P remains unchanged after the measure 
change (i.e. at = cr^). Although in our numerical simulations we use special cases of this result, this risk neutralized 
dynamic can be applied for pricing and hedging under GARCH models for any distribution which satisfy the 
constraints imposed of the beginning of the section. 

Remark 2.4 An important characteristic of the Extended Girsanov Principle is its relationship to the notion of 
locally risk minimizing hedging. Elliott and Madan [40] showed that this new martingale measure is the unique 
probability measure which is consistent with hedging strategies that minimize the one-step ahead quadratic risk 



adjusted cost of hedging. Further comments on such strategies are made at the end of Section 3.1 after we introduce 
the notion of locally risk minimizing strategies. 

2.2 Conditional Esscher transform 

The conditional Esscher transform has been hrst introduced into a GARCH setting by Siu et al. [ST] . Since then, this 
transformation has been used in a variety of empirical studies which dealt with the pricing of financial derivatives 
under different assumptions regarding the innovation process. For example, Badescu et al. 0] and Stentoft and 
Rombouts 69i used it for pricing both European and American style options under Gaussian mixture innovations. 
This method has been also investigated and further generalized by Christoffcrscn et al. . The change of measure 
is based on an exponential affine stochastic discount factor and it has become a very popular tool for many of the 
GARCH option pricing models due to its Radon-Nikodym derivative tractability. Another appealing feature of this 
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change of measure is the fact that it preserves the parametric driving noise distribution family after the change 
of measure. For example, Siu et al |67j and Christoffersen et al |25| derived such a result for infinite divisible 
distributions. 

In the following proposition we characterize the risk neutralized dynamic of the asset return process given in 



(2.2)-(2.3) under the conditional Esscher transform and we show this is a special case of the general family of 



GARCH pricing models from (|2.6[)-(|2.7 1. 

Proposition 2.5 Define the stochastic process N :— {N t }teT : 

Here {0t}teT * s a -Ft -predictable process which satisfies the following martingale condition: 

fit + < ((1 + 6 t )a t ) - < (e t a t ) = r. (2.11) 

Then, the following statements hold: 

t 

(i) The process Z := {Z t }teT defined by Z t :— Yi with Zq — 1 is a P -martingale and Zt defines an equivalent 

fc=i 

measure Q by Zt = jp ■ 

(ii) The innovation process e* := {e* t } t eT defined by: 

7<"(^t) ' 

is a sequence of Tt~\- conditionally uncorrelated zero mean and unit variance random variables under Q. 
Here k P/ and k p " represent the first and second order derivatives of the corresponding cumulant generating 
functions. 

(iii) The risk neutralized asset return dynamic is given by: 

Vt = r-K%{a* t )+a* t c* tl e*~D*(0,l), 
o-f = a* ot + a> t *V«'(M + ^ P :'{6 t Ot)eU) + P>t-v 
Here the conditional variance dynamic coefficients are given by: 

a ot = a oKet (9 t a t ), a u = a, ^—-^ , [3 U = ^ ■ 

Although some proofs of parts of the above theorems are already given in a less general context in some other 
papers, for completeness we provide a complete proof of the theorem in the Appendix. We denote by Q ess the 
change of measure induced by the conditional Esscher transform. It is easy to note that the above risk neutral 
specification is again a special case of the general risk-neutralized GARCH model introduced at the beginning of 
the section. Unlike the Extended Girsanov Principle, the conditional Esscher transform gives rise to a different risk 
neutral conditional variance. For example, using a second order Taylor expansion for the second order derivative 
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of the cumulant generating function Kf t (0 t a t ) around zero, Christoffersen et al |25) show that the risk neutral 
conditional variance will be always larger than its physical counterpart provided that the conditional skewness and 
excess kurtosis are negative and positive, respectively, which is a common stylized fact exhibited by most financial 
time series. A higher implied volatility, computed from option prices, than the physical volatility is a well-known 
empirical finding in the option pricing literature and gives rise to the necessity of incorporating a volatility risk 
premium. Note that the risk neutral dynamic no longer has a standard GARCH structure since the parameters 
apt, a* t and j3* t are F t predictable processes, which makes it a special case of the proposed GARCH risk neutral 



equations (2.6)-(2.7). 

A second implication of the above result is that Esscher transforms does not preserve the innovation distribution 
under the change of measure (i.e. D* ^ D). However, this is not be the case for our numerical simulations which 
are based on Gaussian and Normal Inverse Gaussian (NIG) GARCH models. 

Finally, the process 8 t is a unique solution of the martingale condition in (ii), which uniquely characterizes 



the Esscher transform. An explicit solution of (2.111 is rarely available and solving numerically for 9 t can be 
computationally demanding for an option pricing exercise, thus making the implementation of the local risk min- 
imization hedging scheme not feasible. An alternative solution is to use a Taylor expansion around the origin for 



both cumulant generating functions in (2.11) in order to obtain an approximation. Such a strategy leads to the 
following approximate solution: 

*>^4 (2-12) 
The above approximation coincides with the exact solution when innovations are Gaussian and in this case the risk 
neutral variance equals the physical one. The advantage of using NIG innovations is twofold: firstly, we specify a 
skewed and leptokurtic density which fits the returns better than the Gaussian one and secondly, the conditional 
risk neutral variance under the Esscher transform is different from that given by the Extended Girsanov Principle. 
Moreover, in this setup we have an explicit solution for 9 t which we provide in Section [5] 



3 Risk minimization for GARCH options 

The market associated with GARCH modeled assets is intrinsically incomplete, that is, there exist contingent 
products H that cannot be fully replicated using a self-financing portfolio made out of its underlying and a bond. 
A number of techniques have been developed over the years that tackle simultaneously the pricing and hedging 
problems by replacing the concept of replication with that of hedging efficiency. As we shall see, these techniques 
provide simultaneous expressions for prices and hedging ratios which, even though in most cases they require Monte 
Carlo computations, admit convenient interpretations based on the notion of hedging error minimization that can 
be adapted to various models for the underlying asset. 

In order to provide a brief account of the hedging techniques that we shall be using we start by introducing 
several definitions. 

As we have done so far, we will denote by S t the price of the underlying asset at time t. The symbol r denotes 
the constant and continuously composed risk-free interest rate paid on the currency of the underlying in the period 
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that goes from an arbitrary time step t — 1 to the next t. The price at time t of the riskless asset S such that 
5g = 1, is hence given by S® = e rt . 

Let now H(St) be a European contingent claim that depends on the terminal value of the risky asset St- As 
we already pointed out, in the context of an incomplete market, it will be in general impossible to replicate the 
payoff H by using a self-financing portfolio. Therefore, we introduce the notion of generalized trading strategy, 
in which the possibility of additional investment in the riskless asset throughout the trading periods up to ex- 
piry time T is allowed. All the following statements are made with respect to a fixed filtered probability space 
{Q,P,J 7 ,{J 7 t}te{o,....T}), with P being this time around not the physical probability but an arbitrary probability 
measure. 

• A generalized trading strategy is a pair of stochastic processes such that {£?}te{o,...,T} is adapted and 
{£t}te{i,...,T} is predictable. The associated value process V of is defined as 

V :=$, and V t := &° ■ S° + & • S t , t > 1. 

• The gains process G of the generalized trading strategy £) is given by 

t 

G :=0 and G t := ^ & ■ (S k ~ Sfc-i), t = 0,...,T. 

• The cost process C is defined by the difference 

Cf.= V t -G t , t = 0,...,T. 

These processes have discounted versions V t , G t , and C t defined as: 

t 

S t :=S t e- rt , V t :=V t e- rt , G t := ^6 • (S k - S k -i), and C t := V t - G t . 

k=l 

Assume now that both H and the {St}te{o,....T} arc i n L 2 (Q, P). A generalized trading strategy is called admissible 
for H whenever it is in L 2 (£l, P) and its associated value process is such that 

V T = H, P a.s., V t ,G t e L 2 (n, P), for each t. 

In the context of quadratic criteria, hedging efficiency has been mainly handled in the literature by finding strategies 
that minimize one of the three following risk processes: 

(i) Local risk process: LR t (£° , £) := E p \{C t +i - C t ) 2 \ J 7 *] , t = 0, . . . , T - 1. 

(ii) Remaining conditional risk: RR t (Z°,Z) ■= E P [(C T -C t ) 2 \ T t ] = E P [(H -J^Lt+i^- (S k -S k ^)-V t ) 2 | J" t ], 

t = 0,...,T-l. 

(iii) Global risk: GR(?,Q := i? p [(C T - C ) 2 ] = ELi^ ' - § fe _i) - V ) 2 }. 

We summarize below several reasons why the local risk minimization has been probably the most influential scheme 
in the literature so far: 
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• Availability of a convenient characterization of local risk-minimizing strategies and of an easy to implement 
recursive algorithm to compute them: indeed, it can be shown that an admissible strategy is locally risk- 
minimizing if and only if the discounted cost process is a P-martingale and it is strongly orthogonal to S, in 
the sense that Cov p (S t +i — S t , Ct+i ~ Ct \ T t ) = 0, P-a.s., for any t = 0, . . . , T — 1. Moreover, whenever a 
local risk-minimizing technique exists, it is fully determined by the backwards recursions: 



Vi 



H. 

Cov p (V t+ i, S t+1 -S t \T t ) 
Var p [S t+1 -S t \T t ] 



(V t+1 - V t )(S, 



t+1 



S t ) I T 



E p V t+ i | T t 



E p 
St\T t 



St+i — St I Tt 



(3.1) 
(3.2) 

(3.3) 



The initial investment Vq determined by these relations is referred to as the local risk minimization option 
price associated to the measure P. 

• The minimization of the local risk implies in many situations the minimization of the global and the remaining 
risks: for example when P is a martingale measure, local risk minimizing strategies are variance optimal (that 
is, they minimize the global risk; see |44l Proposition 10.37])) and minimize also the remaining conditional 
risk (see [3H Proposition 10.34]); ; this is in general not true outside the martingale framework (see [SrJl 
Proposition 3.1] for a counterexample). Outside the martingale context things are more complicated and 
there are in general only sufficient conditions that allow the construction of global risk minimizing strategies 
out of local ones; these hypothesis are generically not satisfied in the GARCH context; see for example [4"4l 
Theorem 10.40]. 

• Local risk minimization can be adapted to prescribed changes in the hedging frequency: suppose that the 
life of the option H with maturity in T time steps is partitioned into identical time intervals of duration 
j; this assumption implies the existence of an integer k such that kj — T. We now wish to set up a local 
risk minimizing replication strategy for H in which hedging is carried out once every j time steps. We shall 
denote by ^t+j the hedging ratio at time t that presupposes that the next hedging will take place at time 
t + j. The value of such ratios will be obtained by minimizing the j-spaced local risk process: 



LR^ (f, := E p {Cf> - Cf'f \T t , t = 0, j, 2j, . . . , (k — l)j = T — j 



(3.4) 



Here {C^} is a cost process constructed out of value and gains processes, {v}^} and |g^| that only take 



into account the prices of the underlying assets at time steps t = 0,j,2j, 
integer I such that t = Ij 



, kj = T, in particular, given an 



G 



U) 



i 



/ j £rj ' (Srj 



s. 



(r-l)j 



The solution of the minimization problem for (3.4) is the same as (3.1)-(3.3) with the symbols t+1 replaced 



by t+j and the variable t taking the values t = Q,j, 2j, . . . , (k—l)j = T—j. A convoluted but straightforward 
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induction argument shows that the solution of these recursions is: 



6+j 



= H, 



Gov p (e~ rI H ■ Nt ■ N T -j ■ ■ ■ N 



t+2j , S t +j 



s t I K 



Vai p [S t+j -S t | Ft] 
E P [e- rT H ■ N T ■ NT-j ■ ■ ■ N t+j ] , f = 0, j, 2j,...,T — j, 



(3.5) 
(3.6) 

(3.7) 



Here, for any t = j, 2j, . . . , T, the process N t is given by 
1 - lt{S t - St-j) 



N t = 



E p 



where, 



1 - lt{S t - S t -j) | Tt- d 



It 



= 1 - 



E p 


St — St-j | J- t-j 




[S t -E p 


S t | T t -j 


) 


Var p 


St - St-j T t-j 





(3.8) 



St — St-j \T t --, 



E p 



{S t -S t -jY\T t -j\ 

When the product Zt := Yite{j 2j t} is a positive random variable, the relation Zt '■= dP / dQ m m defines 
a measure equivalent to P, called the minimal martingale measure, for which the discounted partial process 
{Sj, S^j, ■ ■ • , St} is a martingale. The interest of this measure is that, when it exists and in the case j = 1, 
the value process of the local risk-minimizing strategy with respect to the physical measure coincides with 



arbitrage free price for H obtained by using Q m i n as a pricing kernel. Indeed, (3.6| can be rewritten as 

H(S T ) ■ N T ■ N T -j ■ ■ ■ N t+j | F t ] = E^" \e^ T ^H(S T ) | Tt ■ 



E 1 



-r(T-t) 



In a recent paper, Basak and Chabakauri |10| derive the discrete time mean correcting martingale measure as 
their hedging neutral measure by solving a standard minimum variance problem with a budget constraint. The 



optimal ratio from (3.6) corresponds to the optimal hedge per unit asset ratio from Proposition 3 in Basak and 
Chabakauri |10) . However, their hedging formula is slightly different since it is derived based on a budget constraint, 
which does not hold in the local risk minimization setting. 



Although pricing and hedging using the recursion formulas (3.5)-(3.7) might look straightforward, there are 
various implementation issues that need to be taken care of when asset returns are modeled by GARCH processes; 
we analyze in detail some of them in the next section and we propose an alternative hedging scheme. 



3.1 Local risk minimization with respect to a martingale measure 

The minimal martingale measure is usually a signed measure in discrete time settings, so derivative valuation 



based on this approach might not be suitable. For example, the Radon-Nikodym process defined in (3.8) can take 



negative values and this may lead to negative prices when computing the expectations from (3.7) using Monte-Carlo 
simulation. For asset returns governed by GARCH models, one can impose conditions on the parameter set or 
the innovations which can be too restrictive and make the estimation more challenging. A sufficient condition in 
the GARCH context requires the driving noise to be bounded (see for example 62, Proposition 2.6]). However, 
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an unreported numerical simulation study indicates that negative values for N t happen very rarely for GARCH 
processes driven by Gaussian innovations. This will no longer be the case if a variety of other heavier tailed densities 
are used, and one of the objective of the current paper is to explore hedging strategies based on such models. 



The recursive derivation of a derivative price according to ( 3.5 ) - ( 3.7 ) is not an easy task even in settings when 
the minimal martingale measure is a probability measure. Both the option prices and hedging ratios computations 
require the use of Monte Carlo simulations under the physical measure P and the standard filtration {J-t} generated 
by the price process, which in the GARCH setup coincides with the one generated by the driving noise, that is 



Tt = a (ei, . . . , et). An inspection of the expressions (3.5 ) (|377[) shows that in order to compute them we need first 



an estimation of the stochastic discount factors {Nj, . . . , Nt}- When j — 1, these factors can be written down in 



terms of the conditional cumulant generating function; indeed, using (2.4) and (2.5) in (3.8), we obtain: 

( e <(^) _ e r-iH \ _ e < (<Tt) ) 
N t = 1 - t — c -. 

However, when j > 1 this estimation is necessarily conducted using Monte Carlo simulations whose results are 



in turn substituted in the expressions ( 3.5 H 3.7 1 that are also evaluated using Monte Carlo. The variance of the 
resulting estimator is too high and makes the results unacceptable. Moreover, the option prices that result from 
this technique cannot be interpreted as arbitrage free prices if the hedging frequency is lower than periods at 
which asset prices are observed. Indeed, in the case j > 1, the resulting measure is a martingale measure for the 
discounted partial process {Sj, S2j, ■ ■ • , St} but not for the entire discounted price process {Si, S2, ■ ■ ■ , St} and 
therefore there is a leeway for arbitrage. 

As described in Basak and Chabakauri |10j , another potential problem with variance minimizing hedging strate- 
gies is their time-inconsistency. These authors argue that such inconsistencies can be avoided by either considering 
that the market is complete or by assuming a zero risk premium for the asset return. The former does clearly not 
fit our GARCH setting while the latter is equivalent to the discounted asset price being a martingale under the 
physical measure; hence, one way to overcome the time-inconsistency problem is to consider hedging directly under 
a risk neutral measure. 

The reasons that we mentioned above lead us to explore the local risk minimization strategy for martingale 
measures, despite the fact that the historical probability associated with the price process represents a more 
natural approach from a risk management perspective. More specifically, we shall focus on the martingale measures 
introduced in Section [2] Additionally, given that the trend terms that separate the physical measure from being a 
martingale measure are usually very small when dealing with daily or weekly financial returns, it is expected that 
the inaccuracy committed by using a conveniently chosen martingale measure will be smaller than the numerical 
error that we would face in the Monte Carlo evaluation of the expressions corresponding to the physical measure; 
see [621 Proposition 3.2] for an argument in this direction. Thus, following Fdllmer and Sondermann 05], we derive 
recursive pricing and hedging relations using risk minimization under martingale measures. These are obtained in 



a straightforward manner using expressions (3.5)-(3.7) combined with the martingale hypothesis in its statement 



that makes the stochastic discount factors (3.8) {Nt} t £{j,2j,...,T} automatically all equal to 1 
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Proposition 3.1 Let Q be an equivalent martingale measure for the price process {S t } and let H(St) be a European 
contingent claim that depends on the terminal value of the risky asset St ■ The locally risk minimizing strategy every 
j time steps with respect to the measure Q is determined by the recursions: 



Vn 



H. 



EQ [H(S t ) {St +3 e- rj - S t ) | Ft] 



V t U} = E Q 



Var y [St+je- r i 
^H(S T ) I T t 



S t | H 



(3.9) 
(3.10) 

(3.11) 



for all t = 0, j, 2j, ...,T-j. 



For daily hedging (j = 1), the denominator of (3.10) can be explicitly written down using Lemma 2.2 in terms of 
the ^-conditional cumulant generating function nft* of ejfunder Q and of the risk neutral conditional variance o~* t 
as 



Var Q [e~ r S t+1 - S t \ T t ] = S; 



««.(2<T t * + i)-2K^.K +1 ) 

e t t 



1 



For lower hedging frequencies (j > 1) this denominator needs to be evaluated by using the Monte Carlo estimator 
for the variance, that is, 

\ 2 



1 - 

Var« [e' r S t+1 - S t \ F t ] * ^ £ (S? +J -( 

i=l 



, S^_j are N realizations of the price process under Q at time t + j, using paths that have all the 



where Sj + j, 
same origin St at time t. 

The recursion formulas require some explanation. First, local risk minimization under a martingale measure 
does not directly determine the arbitrage free price as in the historical probability optimization case. The option 



price is, a priori, established by assuming that the discounted value process is a martingale under Q; (3.11 1 serves 



as a verification for this fact. The value process V t (j) is the same for any j, so it does not depend on the hedging 
frequency. Vo represents the necessary initial investment to setup the local minimizing strategy to replicate H. 
This coincides with the arbitrage free pricing for the payoff H and it strongly depends on the choice of the pricing 
kernel Q. 



Secondly, the hedging ratios in (3.101 can be computed using standard Monte Carlo techniques by simulating 



asset price paths under the risk neutral measure considered. In cases where simulation under Q is not very 



convenient, one can compute both the numerator and the denominator of (3.10) by simulating historical paths and 



make use of the corresponding Radon-Nikodym derivative. More precisely, if the change of measure is defined by, 
dQ/dP = nLi N k, th en for any Tt measurable function / and any t G {0, . . . , T}, 



Eft [f | Ft] = E 1 



f n n "\^ 
. fe=t+i 



Note that if Q is chosen to be the minimal martingale measure, the discounted value processes from (3.7) and (3.11 1 



coincides, while their corresponding hedging ratios are different. When Q is the pricing measure given by the Ex- 



tended Girsanov Principle, the hedging scheme from (3.1 1 is the same as the one obtained in Elliott and Madan jJDj 
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by minimizing the expected discounted risk adjusted squared costs of hedging under the historical measure. A de- 
tailed discussion about this is presented in Huang and Guo [541155] who constructed numerically efficient algorithms 
that implement this approach to pricing and hedging. Their method, known as LESRAM (Local Expected Squared 
Risk Minimizing Hedging), takes advantage of the decoupling between the computation of option values and hedges 



that one obtains in (3.9)-(3.11 ) by using a martingale measure, in order to interpret the hedges as the coefficients 
of a regression of the discounted option prices on the discounted prices of the underlying and the bond. Addition- 
ally, they introduced a semiparametric valuation method (DSA, Dynamic Semiparametric Approach) in order to 
evaluate the explained variable of this regression as accurately as possible. 



Finally, the optimal hedge ratios from (3.101 minimize the conditional remaining risk and the global risk, since 
these criteria are equivalent under a martingale measure. Thus, these strategies are also variance optimal. 

The empirical studies conducted in the next sections provide a comparison between the hedging strategy per- 
formance of the two risk neutral measures described in Section [2] for GARCH models driven by Gaussian and 
non-Gaussian innovations. The remainder of this section provides of brief discussion between the relationship 
between local risk minimization and standard delta neutral hedging ratios from continuous time models. 



3.2 Local risk minimization and delta hedging 

A particularly attractive feature of risk minimization schemes is that the choice of a specific pricing kernel deter- 
mines simultaneously, and in a consistent way, prices and associated hedging strategies. In this section we focus on 
continuous time hedging schemes and illustrate their relationship to the discrete time hedges introduced before. A 
standard approach is to consider a delta neutral portfolio made out of the underlying and the contingent product 
itself and identify the quantity which should be invested into the asset price such that the instantaneous variance of 
the portfolio is minimized. These strategies are called minimum variance hedge ratios and have been investigated 
in both complete and incomplete settings (see for example Boyle and Emanuel [H] , Bakshi at al. [7] and Alexander 
and Nogueira PQ among others). 

Weak convergence of hedging strategies has been another topic of interest in the mathematical finance liter- 
ature. In the context of quadratic hedging, Prigent and Scaillet [M] investigate the weak convergence of local 
risk minimization strategies to the standard delta hedges given by the partial derivatives of the option price with 
respect to the underlying. In particular, they showed such a convergence result when the asset price is modeled 
by a stochastic volatility model driven by uncorrelated Brownian motion. A key ingredient in their derivation is 
provided by the Markovian structure of the model, which is only the case for a GARCH(1,1) setup. However, 
such a result is beyond the aim of our paper. Instead, in the following proposition, we characterize the instanta- 
neous minimum variance delta hedge in a GARCH environment, and we show its relationship with the local risk 
minimization hedge. 

Proposition 3.2 Let H be a European contingent product on the underlying price process {S f } te j- that has a 
GARCH model as generating process. LetQ be the risk neutral measure and H t = II(St, ofi i(St)) = E® \e r ^ T ~ t ^H \ J- t ~\ 
be its associated price. Construct a portfolio consisting of a long position on a unit of the derivative product and 
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a short one on A f units of the underlying. If the time between two consecutive observations approaches zero, the 
following relationships hold: 

(i) The minimum variance delta hedge is approximately given by: 

A t « Af + Cov^(5 ;+1 _5 t ,^ |^) ^ + 

Var- p [S , t+1 -S t | F t ] dS t 

Here Af = is t/ie standard option delta and Af = ^ffi is £/ie vega o/ t/ie option and the coefficients to 
Af m &o£/i approximations are called the vega multipliers (VM). 



(ii) The approximation (3.12) also holds for the Q local risk minimization hedge £ t : 

6+i*Af+ \ Qr „ ' i J tj Af^Af + -^Af. (3.13) 



Vax«[S t+1 -S t | Ji] dS, 



Hedging under GARCH models is typically done using the standard definition proposed by Duan [3T] , according 
to which the hedge ratio is the partial derivative of the option price with respect to the underlying denoted by Af . 
Garcia and Renault [48] point out that this specification ignores the fact that the option price depends also on the 



conditional variance which is itself a function of the asset price. The second approximation in (3.12) is a direct 
consequence of this since it resembles the total derivative of the option price with respect to the asset price. Note 
that this formula resembles the chain rule formulation used in Engle and Rosenberg |43j . but there are two major 
differences. First, our option delta and vega are computed directly based on the GARCH option price, while Engle 
and Rosenberg [43] used a Hull and White 56 Black-Scholes plug-in type of formula for the option price. Secondly, 
their option price is based on the average expected volatility, while ours uses the current value of the volatility as 
an argument, and thus gives rise to a different specifications of the VM. In the case in which the volatility does 
not depend on the underlying, the formula reduces to the standard delta hedge used in Duan |31j . Garcia and 
Renault [48] showed that when the option price is homogeneous of degree one with respect to the asset price and 
the strike price, then the option delta hedge is consistent with Duan's formulation. Further detailed discussions on 
homogeneity of option prices and delta hedging for scale-invariant models are provided in Bates [11] and Alexander 
and Nogueira [U |2] • 



The first approximation in (3.121 suggests that the minimum variance hedge is consistent with Duan's delta 
hedge when changes in today's asset values are positively correlated to changes in the next period's conditional 
variance. We further notice that the first specification for At depends on the probability measure under which 
the portfolio variance is minimized, while the second expression can be obtained under any probability space. A 
more detailed discussion on these two approximations and their relationship with the corresponding continuous 
time stochastic volatility limits for Gaussian driven GARCH models is provided in the Section |4| 

The second part of Proposition |3.2| states that the local risk minimization hedge under a risk neutral measure 



Q converges to the total derivative expression from (3.12| when the time between two consecutive observations 
shrinks to zero. Moreover, the first approximation in (i) should also hold for the P - local risk minimization hedge, 
but only in the special case when the contingent claim price is computed based on the minimal martingale measure. 
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Huang and Guo |55j showed that the local risk minimization taken with respect to the risk neutral provided by 
the Extended Girsanov Principle converges to the Black Scholes delta hedge when the underlying is governed by 
a Geometric Brownian motion. This result can be viewed as a special case of (ii). There are situations (see El 
Karoui et al. [39], Poulsen et al. |63| ) in the continuous time setup and in the presence of a well defined minimal 



martingale measure, in which a volatility adjusted hedging ratio similar to (3.12) can be obtained out of local 
risk minimization, that is, without assumptions on the pricing kernel and by constructing a hedging portfolio that 
involves trading exclusively in the underlying asset and the riskless asset. 

In the next two sections we shall investigate the empirical performance of the local risk minimization hedges 
and compare them with the instantaneous delta hedges that we described above for European call options. 

4 Hedging Gaussian GARCH options 

The goal of this section is to provide a theoretical and empirical analysis of the hedging schemes previously proposed 
for a general class of GARCH(1,1) models driven by Gaussian innovations. Consider then a GARCH(1,1) model 



as in (2.2 1 and (2.3) with a conditional mean prescription given by: 

Ht :^r + Xa t - <(<r t ), Ael. (4.1) 

The main reason for choosing such a parametrization for the conditional mean return is that one can interpret A 
as the market price of risk per unit volatility. Since, we also assume that the conditional distribution D(0, 1) of 
the innovations is normal we have that n^ t {(Jt) = of /2 and hence the return dynamics under the historical measure 
are identical to those in Duan |31j . namely: 

Vt = r + \(T t -^a 2 t +a t e u e t ~N(0,l), (4.2) 
of = a + a^a^ulet-i) + $\<t\_x- ( 4 - 3 ) 

The Gaussian hypothesis makes identical the martingale measures Q egp and Q ess introduced in Section [2] so 
through this section we shall write just Q. Moreover, the specific conditional mean prescription chosen in ( |4.1| 



implies that the unit risk premium g t in (2.10) is constant and equal to A, that is, Qt = A; this simplifies many of 
the computations that follow in this section. Then, under either of the martingale measures Q (that is, Q egp or 
Qess) we can write: 

2 

Vt = r-^+a t e* t , eJ~N(0,l), (4.4) 
of = a + a 1 (Tt_ 1 uj(e* t _ 1 - A) + $\(%__ v (4.5) 

This specification coincides to the one obtained by Duan [31] by applying the local risk neutral valuation principle. 
We now write down two expressions that we shall need in deriving the vega of a European call option that uses 



this GARCH model for the underlying asset. First, using (4.4)-(4.5), the time evolution of the prices under Q is 
given by 

S T = S' t e r(T ~* ) ~5 E < r =*+ 1 ' Ti2+i: < r = t + 1<T!e! * . (4.6) 
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Secondly, we let 



Pi := a 1 Lo(e* - A) + i = 0,...,T. 



A straightforward inductive argument using (4.5 1 implies that 



o\ = oc A(s,l) +o?B(s,l) with 1 = 0, ...T-l and s > I, 



and 



A(s,l) :=n^ 



s-1 



and B(s,0:=n Pi ' 



(4.7) 



The delta and vega of an European call option based on the Gaussian GARCH model are computed below. 



Proposition 4.1 Given a European call option with strike price K that has \4.2)-\4.S) as log-returns generating 
process, the corresponding delta and vega hedges are given by: 

St ^ 



e ~r(T-t) E Q 
e -r(T-t) 



S t 



L {St>K} 



-Efl 



fT-t 



vZ = l 



\^ i -l)B{t + l,t+l) ] 1 {St >k } I T t 



(4.8) 
(4.9) 



Here B(-, •) is given as in (4-1) with the convention B(t + l,t + 1) := 1. 



Equation (4.8) resembles the formula derived in Duan [31] for Gaussian GARCH. We provide an alternative proof 
of this in the appendix and we argue that this relationship also holds for any type of asset return process with 



a risk-neutralization given as in (2.6)-(2.7). Formula (4.9) gives a representation for the option vega; a similar 
expression for the vega of a European call was given in Duan [35] for an asymmetric GARCH model, but without 
proof or numerical implementation. We provide a detailed proof for this expression in the appendix. Since there 
are no closed form results available for the conditional expectations in the thesis of Proposition |4.1[ we use Monte 
Carlo simulations for evaluating them. 

In order to compute the minimum variance hedge we only need to compute the correction terms in the two 



approximations in (3.12). We shall do so later by making specific assumptions about the form of the function 



uj(-) in (4.3); at this stage, we only write down the expression for the derivative of the conditional variance 
with respect to the asset price since no requirements on the volatility specification are required. Indeed, since 
St = St-ie r+X(Jt ~^ a ' t +<Tt£t , we solve for e t and we obtain 



1 

e* = — 
0t 



log(St) - log(5 t _0 - r - Atr t + 



If we substitute this expression into (|4.3|) we can easily compute the VM using the chain rule and obtain: 

a\(Tt dio (e t ) 



dS t 



S t dt t 



(4.10) 



The minimum variance hedge is then obtained by substituting (4.8), (4.9) and (4.10) into (3.12). The empirical 
performance of this hedging ratio is tested in a numerical experiment at the end of this section. In the next 
subsection, we propose a different formula for the minimum variance hedge using an alternative approach based 



on a continuous time limit for the correction term used in the first approximation in (3.12). 
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4.1 Local risk minimization in stochastic volatility GARCH models 

In this section we investigate the local risk minimization hedging strategy within the class of stochastic volatility 
models which are obtained as weak limits of Gaussian GARCH models and study its relationship to the minimum 
variance hedge that we derived in the previous section. The weak convergence of GARCH models to bivariate 
diffusions where the volatility process and the price process are uncorrelated was first studied by Nelson [pTj . 
Following this approach Duan [33] studied the continuous time limit of a more general class of augmented GARCH 
models, which includes the most commonly used GARCH prescriptions in the literature. The convergence results are 
typically based on a set of non-unique assumptions on the model parameters. Non-uniqueness leads to the existence 
of other diffusion limits in the literature (see for example Corradi Heston and Nandi [5T] or Kluppelberg et 
al. [59] among others). Since the bivariate diffusion limits of GARCH models proposed by Nelson [61] and Duan [33] 
are of widespread use for pricing and hedging derivatives in the continuous time setting, we restrict our attention 
to those. 

In order to emphasize the effect of the correlation between the the price process and the variance process 
on the minimum variance hedging, we investigate the continuous limit of a GARCH model that incorporates a 
leverage effect. More specifically, we consider the class of the asymmetric NGARCH models, which corresponds 



to choosing uj{e t ) = (e< — 7) in (4.3), where 7 is the leverage parameter. Following Duan |33j . we first consider a 



finer discretization of the model from ( 4.2 )-( 4.3 ) by dividing the interval [0, . . . ,T] into nT equally spaced points 
of length h — 1/n and indexed by k, k = 1, 2, • • • , nT. The IID sequence of standard Gaussian driving innovations 
is denoted by e k for any fc = 1,2,..., nT. Let T n .k be the information set generated by ei, . . . , e k - Under these 
assumptions, the return and conditional variance have the following representation under the historical measure P: 

\nS k ,h - mSfc-vj = + Xa k ,h - ^t,hj h + <?k,hVhe k , e fe ~ N(0, 1), (4-11) 
4+i,h-4,h = (ao + ^(/3 + a 1 (l + 7 2 )-l))/i + «i<, l ((e fc - 7 ) 2 -(l+ 7 2 ))^. (4.12) 



When n = 1, the above process resembles the standard NGARCH(1,1) specification of ( 4.2 )-( 4.3 1 . This model can 
be further extend to a bivariate continuous time process as follows: 

( S t,h,<rt,h) '■= ( S k,h,<rl + i,h) > t E [kh,(k + l)h) , k = 0, 1, . . . , nT - 1. 

Duan (1997) [] showed that when h approaches zero, (St,h,crthj converges weakly to a bivariate diffusion ( St, erf) 
which satisfies the following stochastic differential equations: 

din = + Xa t - ^a^j dt + a t dB lt , (4.13) 

dol = (q + ct 2 (/3 + a x (l + 7 2 ) - 1)) dt- 2a 1 ja^dB lt + ^2a 1 <j 2 t dB 2 t- (4.14) 

Here, B\ t and B 2 t are two independent standard Brownian motions under P. Applying a discretized version of the 



extended Girsanov transform to the bivariate process given in (4.11 )-(4.12), the risk neutral specifications become: 

lnS kih -lnS k - hh = (r-\ul^h + (j kjh Vhtt, 4~N(0,1), (4.15) 

2 2 

a k+l,h ~ a k.h 



{a + a% h {p + a x {\ + 7 2 ) -l))h + ((e£ - 7 - XVh) 2 - (1 + 7 2 )) vft.(4.16) 
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Taking the limit when h — > of \St,in&i h) un der the martingale measure Q given by the Extended Girsanov 
Principle (or conditional Esscher transform), we get: 

1 



d In St — 



af dt + a t dB* lt , 



daf = (a + cTt(l3 + ai(l+j 2 )~l + 2a 1 j\))dt-2aijatdBi t + V2ai<JtdB, 



21 ■ 



(4.17) 
(4.18) 



Here, B\ t and B% t are two independent standard Brownian motions under Q, such that dB^ t = dB u + Xdt and 



dB\ t = dB2t- An interesting aspect is provided by the fact that equations ( 4.17 )-( 4.18 ) can be obtained directly 



by applying the continuous time version of the minimal martingale measure to the bivariate diffusion from (4.13)- 



(4.14). A more detailed discussion about this is provided in Duan [32] . In the next proposition we compute the 
local risk minimization hedging strategy for the continuous time limit of the NGARCH process. 

Proposition 4.2 Suppose that the asset returns are governed by the stochastic volatility process determined by 
\4-13 )-(4-14)- Then the price Hf" and the local risk minimizing delta hedge Af of a European option with payoff 
H and maturity T , conditional on initial values {x, y) for the asset price and volatility process, respectively, are 
given by: 



e -r(T~t) E Q [H | F x 



Ar(x,y) = A b t {x,y)~2a ll y -Al{x,y) 



(4.19) 
(4.20) 



Here Tt' v = Tt U {(St, at) — (x,y)}, with Tt being the standard filtration in the stochastic volatility setup and 
Af(x,y) and A^(x,y) are the standard option delta and vega conditional on the initial values St — x and a t = y 
for the asset price and the conditional volatility process. The option prices are computed based on the minimal 



martingale measure Q under which the asset returns process follow (4-.ll)-{J h .18) 



The proof of Proposition 4.2 follows the same three steps procedure as in Poulsen et al. [63]; this algorithm follows 
the construction from El Karoui et al. [39j in which the market is first completed by introducing another tradable 
asset, then the delta hedge is computed in this setting and finally projected onto the original market. We notice two 
particular cases: first, ct\ = corresponds to a deterministic conditional variance and the local risk minimization 
hedging reduces to the standard Black Scholes delta hedge. Second, when 7 = the two Brownian motions are 
uncorrelated and in this situation the VM equals zero, so the option hedge corresponds to Af (x,y). 

Based on the discretization proposed above and its connection to the continuous time setting, we propose a 
different instantaneous delta hedge for the Gaussian GARCH setting. The next result serves as a justification for 
the use of that hedge instead of the one proposed by Duan [3T] and shows that the VMs of the GARCH models 
converge to their continuous time counterparts. 



Proposition 4.3 Suppose that (Sk,hi a k+x h) f°^ ows ^ e dynamics of (4-.ll)-(4--12) under the historical measure 
P and ( J t .lh^ -{^.16) under the risk neutral measure Q. Then the following relation holds: 



lim 

h-tO 



Cov p {S k+hh - S k ,h, o 2 k+2 , h - o- 2 k+1 . h I Tl%) 



Vax p [S k+lth 



S, 



k.i, 



• r k,hl 



lim 



Cov Q (S k+1 . h S k , h ,<T 2 +2 . h - o- 2 +hh I 7g) 
V & i Q [S k+1Jl -S kJl I Flf] 



-2ai7 — 
x 

(4.21) 
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Here T l% -- 
innovations. 



J~k,h U {(Sk,h, o~k = (x,y)}, with J-k.h being the standard filtration generated by the GARCH 



The proof is provided in the Appendix. Based on the continuous time analogy, we argue that the instantaneous 



local risk minimization delta hedge from (3.131 resembles Duan's delta hedge when either there are no ARCH 



effects or changes in asset prices are uncorrelated to changes in conditional variances and this translates to the 
absence of a leverage effect. When 7^0 one should include a correction term into Duan's delta hedging formula. 



Proposition 4.3 indicates that, if the time between two consecutive trades shrinks to zero and the GARCH greeks 
converge to the corresponding stochastic volatility greeks, then the minimum variance hedge approaches the local 



risk minimization stochastic volatility hedge. Moreover, from (3.13), the Gaussian GARCH local risk minimization 



hedge under Q converges to its continuous time counterpart. In view of these arguments, we propose the following 
alternative delta hedge under the NGARCH model: 



^mvsv 



, -2a l7 ?A" 



S t 



(4.22) 



When fitted to financial return data, the asymmetric GARCH model typically yields a negative leverage effect, 



7 > 0, so a non-negative value of the option vega leads to a smaller value of the adjusted hedge ratio from (4.221 
when compared to the one proposed by Duan [31] . In the numerical experiment from the next subsection, we 



implement this continuous time adjusted delta hedge, A™ 118 ", as well as the second approximation from (3.12) 
which in the NGARCH case leads to: 



Ar = Af + 2o 1 (e t - 7 )^A t ff 



(4.23) 



Although the results presented in this section are based on the NGARCH model, the same approach for the adjusted 
delta hedge can be implemented for other GARCH models by spelling out their continuous time limit. 



4.2 Numerical comparisons of local risk minimization schemes under Gaussian GARCH 
models 

In this section we provide numerical experiments to compare the hedging performance of our proposed schemes for 
the NGARCH(1,1) option pricing model from (4.2)-(4.3) with u>(e t ) = (e t — r y) 2 - In order to work with realistic 



parameters, we estimate our model based on historical daily closing prices of the S&P 500 index from January 2, 
1990 to August 30 2005, for a total of 3,951 observations. The NGARCH parameter values are consistent with 
what is usually found in the financial econometrics literature and are given below: 



a 



1.1 • 10" 6 , 



Oil 



-0.0556, 



p! = 0.8894, 



A = 0.0302, 



7 



-0.9098. 



We also assume the the annual interest rate is 0.7%. As in Ortega 62J, for each hedging strategy considered, we 
compute the mean square hedging error (MSHE) under the physical probability as follows: 

2" 



MSHE{£) = E 1 



k=l 



(4.24) 
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Since our study focuses on European Call options, the payoff at maturity of the option to hedge is H = (St — K) + - 
Vq represents the option price and it is computed based on the NGARCH model with Gaussian innovations using a 
Monte Carlo simulation, Vq — e~ rT E® [(St — K)+]- The four competing hedging strategies £w (the upper script 
(j) indicates that rebalancing takes place every j days) are summarized below: 



• £ = (£t)*e{j,2j t}i where £t is the Q local risk minimization (LRM) hedge computed using equation (3.10). 

• A s = (Af )te{o,j,...,T-j}, where Af is the instantaneous delta hedge proposed by Duan [31] and computed 
using equation (4.8). 

• A mv = (AJ nl! ) t g/oj j ... i T-j}; where A™" is the minimum variance instantaneous delta hedge computed using 



equation (4.23). 

• A mvsv = (Al nvsv ) t( z{oj _ ^T-j}, where A™ vsv is the stochastic volatility adjusted minimum variance instan- 



taneous delta hedge computed using equation (4.22). 

Most empirical studies on GARCH option pricing and hedging include the Black-Scholes model into their com- 
parison. There are two reasons we discard such a model from our simulation experiments: firstly, Ortega |62j 
found empirical evidence that both the local risk minimization and Duan's [31] outperform the Black-Scholes delta 
hedge, and secondly, since we prefer to investigate only the effect of the hedging scheme used in the behavior of 
the MSHE, it is better to put aside the Black-Scholes strategy because it uses an option price different to the 
GARCH one. 

Since there are no closed form expressions available, all quantities necessary to compute the above hedging 
ratios require Monte Carlo simulations. As previously mentioned, this can be carried out in two ways: either by 
simulating the price paths directly under Q, or by simulating the paths under P and averaging the quantities of 
interest adjusted by the corresponding Radon-Nikodym derivatives. We adopt here the second approach since this 
leads to a variance reduction in the Monte Carlo estimators (see for example Badescu and Kulperger [3] for a 
discussion about computing option prices by simulating under the historical measure P). 

We run three different exercises depending on the time to maturity T of our options and on the hedging 
frequency: 

• Exercise 1: T — 6, 8, 10 and 12 days and rebalancing is done in a daily basis. 

• Exercise 2: T = 10, 20, 30 and 40 days and rebalancing is done every 10 days. 

• Exercise 2: T = 20, 40, 60 and 80 days and rebalancing is done every 20 days. 

For all our simulations we consider the same starting value for the asset price, So = 100. In order to investigate the 
sensitivity of the hedging method to the option moneyness, we consider three strike prices, K = 90, 100 and 110. 
We use the unconditional variance for the NGARCH model as the starting variance for our numerical simulations. 
The option prices, Vq are computed based on 10,000 random price paths and each expectation from the hedging 
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Table 4.1: Gaussian NGARCH average Mean Square Hedging Errors (MS HE) (computed across all maturities) for each moneyness 
and exercise. Average values for the corresponding standard errors arc reported in the brackets. 





Moneyness 




A s 


^ TYIV 


^mvsv 




1.11 


0.4049 


0.0136 


0.0286 


0.0209 






(0.0371) 


(0.0029) 


(0.0062) 


(0.0045) 




1.00 


0.2342 


0.2159 


0.2039 


0.2074 


Exercise 1 




(0.0162) 


(0.0159) 


(0.0157) 


(0.0144) 




0.9 


0.0016 


0.0022 


0.0020 


0.0014 






(0.0006) 


(0.0009) 


(0.0008) 


(0.0007) 




1.11 


0.5429 


0.3853 


0.4904 


0.3645 






(0.0930) 


(0.1049) 


(0.1064) 


(0.0818) 




1.00 


1.4611 


1.5502 


1.4774 


1.4026 


Exercise 2 




(0.1253) 


(0.1685) 


(0.1382) 


(0.1211) 




0.9 


0.1172 


0.1602 


0.1405 


0.1139 






(0.0227) 


(0.0202) 


(0.0269) 


(0.0239) 




1.11 


1.8036 


1.2805 


1.9367 


1.4446 






(0.3991) 


(0.3574) 


(0.4909) 


(0.3770) 




1.00 


3.1631 


3.2265 


3.5204 


3.0039 


Exercise 3 




(0.3511) 


(0.4067) 


(0.4155) 


(0.3444) 




0.9 


0.9276 


1.2082 


1.2257 


0.9147 






(0.2223) 


(0.2042) 


(0.2509) 


(0.2184) 
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calculations are based on 5,000 Monte Carlo paths. The mean square hedging error associated with every hedging 
strategy will be computed by taking the mean of the terminal square hedging errors over 1000 price paths. 

We start our analysis by examining the aggregate MSHE for each exercise. The very short term options with 
a daily rebalancing indicate that all instantaneous delta hedges (A s , A mv and A mvsv ) produce almost the same 
MSHE value of around 0.077, which is much lower than the corresponding MSHE for the local risk minimization 
hedge of 0.215. For the second exercise, the overall performance of our methods is much closer, the stochastic 
volatility adjusted minimum variance hedge having a slightly lower error of around 0.63, all other methods being at 
0.7. The longer term option combined with the lower frequency rebalancing suggest that A mvsv (MSHE = 1.78) 
is the overall best strategy, while A mv (M SHE = 2.22) being the worst. We can also conclude that the minimum 
variance approximation does not work very well in terms of this hedging criterion measure. 

Additional interesting conclusions can be drawn by analyzing the hedging errors for each moneyness considered. 
The average values for the MSHE (computed across maturities) and the corresponding standard error averages 
are presented in Table |4~T) All three exercises document that the behavior of our hedging scheme proposed depends 
crucially on the option's moneyness. For example, for the in-the-money (ITM) options in Exercises 1 and 3, we 
note that Duan's delta hedge provides the lowest MSHE, while its performance for Exercise 2 comes into a very 
close second place (A mvsv is the best in this case). The out of-the-money (OTM) options consistently indicate that 
A mvsv is the best method for all exercises, with the local risk minimizing strategy being also very close in value, £ 
(the largest MSHE difference between these two methods is 0.013. Both the other methods perform much worse 
in this case. For at-the-money (ATM) options there are no significant differences between the hedging strategies. 
However, we still note that A mvsv represent the best scheme for Exercises 2 and 3, followed by £ again. Moreover, 
the standard errors corresponding to these two methods are the smallest overall. 

The smaller MSHE values for the deeper ITM and OTM European Call options confirm that these contracts are 
easier to be hedged compared to the ATM counterparts. Such findings have been also documented in the literature 
in various forms. For example, the recent study by Denkl et al. [30] which investigates the hedging performance of 
delta hedge ratios for exponential Levy processes, indicates a concave behaviour of the MSHE with respect to the 
spot price and for a fixed strike price. The MSHE increases as the option is near the money. The behaviour of the 
mean square is further illustrated in Figures |4.1| These graphs confirm our previous findings that A mvsv and £ 
outperform the other methods for longer term, ATM and OTM options, while the local risk minimization provide 
a poor alternative for very short maturity options hedged with daily frequency. The effect of the hedging horizon 
can be depicted by comparing the results for the same maturity and different exercise. For example, when T = 40, 
the hedging every 10 days instead of 20 days improves the mean square hedging error by 37% for A s , 43% for £, 
44% for A™" and by 46% for A mvsv . Thus, the proposed stochastic volatility adjusted delta hedge benefits the 
most when the hedging frequency is augmented. 
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(a) Exercise 1. The four maturities considered are 6, 8, 10, and 12 days and hedging is carried out daily. 




(b) Exercise 2. The four maturities considered are 10, 20, 30, and 40 days and hedging is carried out every 10 days. 




(c) Exercise 3. The four maturities considered are 20, 40, 60, 80 days and hedging is carried out every 20 days. 
Figure 4.1: Mean square hedging error (MSHE) for the Gaussian NGARCH model and for the following hedging schemes, local risk 
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5 Numerical comparisons of Q-local risk minimization schemes under 
Normal Inverse Gaussian GARCH models 

In this section we provide an empirical study of local risk minimization (LRM) hedging strategies constructed 
using a martingale measure for a special case of non-Gaussian GARCH models with Normal Inverse Gaussian 
(NIG) innovations. The NIG distribution has been studied in the context of the modeling of financial returns 
by Barndorff-Nielsen [8], and has been used in the ARCH setting by Jensen and Lunde [57]. More recently, the 
performance of GARCH option pricing models based on NIG innovations was investigated by Badescu et cd. [3] for 
European style options and by Stentoft |68j for American options, among others. Empirical findings indicate that 
the NIG density is more appropriate for capturing the negative skewness and thicker tails generally displayed by 
asset returns than its Gaussian counterpart; moreover, option prices based on a NIG-GARCH model outperform the 
Gaussian benchmark. In the following paragraphs, we briefly review several notions related to the NIG distribution 
for the driving noise process and discuss its implication for option pricing based on the conditional Esscher transform 
and the Extended Girsanov Principle used in our numerical simulations. 

We suppose that the innovations e t follow a conditional NIG distribution with parameters k, a, s and £, that is, 
£t | Tt ~ NIG(fc, a, s, £), and with probability density function given by: 



fe t ( x ) = — exp 



Vk 2 



a 



x-l 



*x(*Vl+(^) a ) 



(5.1) 



V 1 + (^) 2 

Here Ki() represents the modified Bessel function of third kind and index 1. The parameter k measures the kurtosis 
of the distribution, a the asymmetry, s is the scale parameter, and i is the location of the distribution. Note that 
this NIG parametrization is particularly appropriate for practical implementations since it makes et belong to 
a location-scale family of distributions; for this reason, k and a are called invariant parameters. The cumulant 
generating function is given by: 

k£(z) = zt + (Vfc 2 -a 2 - yjk 2 - (a + zs) 2 ) . (5.2) 
Thus, the first two conditional historical cumulants are: 



E p [e t \T t -x] = 



Var p [e t | F t -i] 



d<(*) 



dz 



= e + 



z=0 \/k 



2 



d 2 <{z) 



21.2 



S 2 k 



z=o (Vfc 2 - a 2 ) 3 ' 



dz 2 

Since the GARCH innovations have mean zero and unit variance this leads to a reduction in the NIG parameters 
which need to be estimated. The scale and the location can therefore be expressed in terms of the invariant 
parameters: 



From the scale-invariance property and (2.2 1 it follows that the asset returns are conditionally NIG distributed 
with the following parameters: 

Vt I -Ft~NIG(fc, a, sa u m + la t ). (5.3) 
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We now have all the tools needed to derive risk-neutralized dynamics based on our NIG-GARCH model. Unlike 
the Gaussian case, the two measures proposed in Section [2] lead to different equations for the return process under 
the pricing measure. Although such dynamics are available and can be easily obtained by inserting the above 
quantities into Propositions |2.3| and |2.5[ we undertake a similar approach as in the previous section by computing 
hedging errors based on the computation of cumulants under the historical measure P. This requires only the 
knowledge the P-returns dynamic and the two Radon-Nikodym derivatives corresponding to the pricing kernels. 



To fully specify the model, we assume the same structure for conditional mean return, u. t as in (4.1) and the same 



asymmetric conditional variance specification. Thus, the historical dynamics for the returns is given by: 



yt 



= r + Acr t - nf t (<7 t ) + a t e t , e t ~ NIG(fc, a, s,£), 



(5.4) 
(5.5) 



Here the cumulant generating function is given in (5.2) 



For the Extended Girsanov Principle, the market prices of risk is constant and the Radon-Nikodym derivative is 
given by: 



dP 



exp 



aXT 

s 



t~t / s 2 + (£ t -*) 2 Mfcy 1 

hi V s 2 + (e* + A- if 



' e t +X- 



For the conditional Esscher transform, the Radon-Nikodym derivative is given by: 

T 



(5.6) 



dQ e 



dp 



exp 



exp 



t a t (e t -£) + Jk 2 -(a + 9 t a tS y 



(5.7) 



For any t £ [0, • • • ,T], the Esscher parameter 6 t is the unique solution of the martingale equation (2.11). As 
previously mentioned, one of the advantages of using the NIG distribution in practice is that a closed form expression 
for 9 t is available, namely: 



l__j L _l_ / «(a t )-(A + l)q t ) 2 
2 cr f s 2 V ofs 2 



4fc 2 



1 



(5- 



W 2 S 2 + «(a t )-(A + ^7 4 ) 2 
When an exact solution for 9 t does not exist (for example the case of hyperbolic innovations) one can numerically 



solve (2.11) or use the second order Taylor approximation (2.12) 



The numerical experiments that we present in this section are based on NIG-GARCH parameters obtained via 



maximum likelihood estimation (MLE) on the same data set of asset returns used in Section 4.2 
a Q = 8.8 ■ 1CT 7 , ai = -0.0537, ft = 0.8946, A = 0.0318, 



7 



0.9157. 



The estimated values for the NIG invariant parameters are, k = 3.0897 and a = —0.3353. As in the Gaussian case, 
we run the same three exercises that study different option maturities and rebalancing frequencies. Since one of 
the main objectives of this section is to examine the effect of the pricing measure in the hedging performance, we 
consider only local risk minimization (LRM), £, and Duan's (1995), A s , as our hedging candidates. Both quantities 
are computed by simulating price paths under P and by making use of the Extended Girsanov Principle and the 
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conditional Esscher transform via the corresponding Radon-Nikodym derivatives (5.6 1 and (5.7), respectively. The 



minimum variance delta hedges considered in the Gaussian NGARCH case are left out of this exercise because 
our continuous time limit diffusion approximation does not work in the case of the Esscher transform since the 
asset returns do not exhibit a standard NGARCH dynamic after the change of measure. The MSHE is evaluated 



based on equation (4.24), the only difference being that the option price Vq is now computed separately for the 



each pricing measure. We use the same starting values for the asset price and the conditional starting variance, 
as well as the same number of Monte-Carlo simulations necessary for computing the prices and hedges as in the 
Gaussian case. The values for the MSHE (computed across maturities) and the corresponding standard errors 
are presented in Table [572[ while the dependencies of the MSHEs with respect to each maturity are plotted in 



Figure 5.1 and Figure 5.2 



Table 5.2: NIG-NGARCH Mean Square Hedging Errors (MSHE) (computed across all maturities) for each moneyness and exercise. 
Values for the corresponding standard errors are reported between brackets. 





Moneyness 




A s 




A s 




1.11 


0.4464 


0.0196 


0.4466 


0.0197 






(0.0393) 


(0.0055) 


(0.0391) 


(0.0055) 




1.00 


0.3260 


0.3088 


0.3273 


0.3102 


Exercise 1 




(0.0266) 


(0.0267) 


(0.0266) 


(0.0267) 




0.9 


0.0064 


0.0088 


0.0063 


0.0088 






(0.0033) 


(0.0047) 


(0.0033) 


(0.0047) 




1.11 


0.6365 


0.4031 


0.6445 


0.4023 






(0.0861) 


(0.1095) 


(0.0859) 


(0.1089) 




1.00 


1.4988 


1.5911 


1.5025 


1.5949 


Exercise 2 




(0.1313) 


(0.1645) 


(0.1308) 


(0.1639) 




0.9 


0.1231 


0.1994 


0.1225 


0.2017 






(0.0262) 


(0.0421) 


(0.0262) 


(0.0424) 




1.11 


1.8483 


1.6573 


1.8589 


1.6538 






(0.2756) 


(0.3821) 


(0.2716) 


(0.3796) 




1.00 


3.5731 


3.8464 


3.5864 


3.8684 


Exercise 3 




(0.3022) 


(0.4289) 


(0.3006) 


(0.4298) 




0.9 


0.9399 


1.1849 


0.9413 


1.1983 






(0.1289) 


(0.1164) 


(0.1293) 


(0.1171) 



We first notice that there are no significant differences between the two pricing measures in terms of the hedging 
performance and their standard errors. Thus, we next look at the differences between the hedging prescriptions. 
Except for Exercise 1, for which LRM performs rather poorly, the overall average MSHE of £ eff p/£ eS s and Af gp / Af ss 
are almost equal. However, as in the Gaussian, case each method behaves differently depending on options' 
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moneyness. For example, Duan's A outperforms the local risk minimization for all ITM options while the latter 
is the best hedging scheme for almost all other cases except ATM short term options (as in Exercise 1). A closer 



look at Table 4.1 and Table 5.2 for Exercises 2 and 3 reveals that the overall MSHEs for the corresponding LRM 
strategies are not far from each other (with generally smaller standard errors for the NIG case). However, such a 



comparison may be misleading; indeed, the options prices that appear in (4.24) arc different and moreover, they are 
computed together with the other necessary quantities via Monte Carlo simulations using different starting values 
of the conditional variance process. Therefore, in order to compare the hedging performance between GARCH 
models with different innovations, one has to take into account all the potential sources of errors which may end 
up playing an important role in the measure of the hedging performance. 



6 Conclusions 

In this paper we propose and analyze a variety of option hedging schemes for GARCH models driven by Gaussian 
and non-Gaussian innovations. Since the well-known minimal martingale measure is typically a signed measure for 
such models, the construction of local risk minimization (LRM) schemes with respect to the historical measure is 
not possible. Therefore, we first introduce the notion of LRM with respect to a martingale measure for general 
GARCH setups. The changes of measure used in our numerical experiments are the Extended Girsanov Principle 
and the conditional Esscher transform, for which we describe the risk neutral dynamics. 

Next we analyze local risk minimization in the context of Gaussian GARCH models. We show that when time 
between two consecutive hedging trades is very small, the local risk minimization scheme can be approximated 
in two different ways. Both approximations can be viewed as generalizations of Duan's delta hedge formula |31j 
by taking into account the dependence of tomorrow's conditional variance on today's asset price. The first limit 
approximation is inspired in some of the existing results on the risk minimization for continuous time stochastic 
volatility models. More specifically, for a special asymmetric GARCH model, we propose a novel minimum variance 
delta hedge based on the continuous time bivariate diffusion limit of such a model. The second limit approximation 
is a minimum variance hedge that is obtained in a straightforward manner using chain rule differentiation. An 
extensive numerical experiment is constructed in order to compare these hedging schemes. Our simulation results 
suggest that the stochastic volatility inspired minimum variance hedge (A ml,s ") is the best overall hedging method 
in the sense that it provides the smallest mean square hedging error. The local risk minimization (£) comes into 
a close second place, although this method performs poorly for very short maturity options with high frequency 
rebalancing dates. After a more detailed data analysis, we found that both A mvsv and £ outperforms the other 
two methods for longer maturity options and lower hedging frequencies; interestingly, Duan's delta hedge is the 
best method for hedging out of-the-money options. Our results are consistent with some of the findings from the 
continuous time hedging literature according to which, local risk minimization provides the best hedge when the 
two driving Brownian motions of the corresponding stochastic volatility model are correlated. 

Finally, we test the sensitivity of the local risk minimization and the Duan hedges with respect to the martingale 
measure used. Numerical experiments indicate there are no significant differences between the Extended Girsanov 
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(a) Exercise 1. The four maturities considered are 6, 8, 10, and 12 days and hedging is carried out daily. 




(b) Exercise 2. The four maturities considered are 10, 20, 30, and 40 days and hedging is carried out every 10 days. 




(c) Exercise 3. The four maturities considered are 20, 40, 60, 80 days and hedging is carried out every 20 days. 
Figure 5.1: Mean square hedging error (MSHE) for the NIG-NGARCH model under the Extended Girsanov Principle for the schemes 
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(c) Exercise 3. The four maturities considered are 20, 40, 60, 80 days and hedging is carried out every 20 days. 
Figure 5.2: Mean square hedging error (MSHE) for the NIG-NGARCH model under the conditional Esscher transform for the schemes 
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Principle and the conditional Esscher transform. As in the Gaussian case, we found that the LRM outperforms 
Duan's delta hedge for long term options and lower hedging frequencies, while the latter is better for dealing 
with in-the-money options. It will be interesting to test and compare our hedging schemes for different types of 
Gaussian and non-Gaussian GARCH models on real option data sets, including very long term contracts. Since our 
stochastic volatility adjusted scheme outperformed all other candidates, other similar extensions can be proposed 
for a variety of non-Gaussian GARCH models. 
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7 Appendix 



7.1 Proof of Proposition 2.3 



(i) Using that g t is a J^-predictable process, we proceed by computing the one-step conditional mean of N t : 

& + Qt) 



E p [N t \T t -i] = J 



f p (x)dx = / f? t (y)dy = 1. 



Thus, the martingale property of Z t follows imediatelly: 

E p [Z t \T t -i\ = Z t - X E P [N t \F t -i] = Zt-v 

Since Zt '■= dQ/dP > is by construction non-negative, the fact that Q is an equivalent probability measure with 
respect to P follows from: 

eQ[i] = e p [z t ] = e p [z ] = i. 

(ii) Since t* t :— e t + gt, we notice that Tt = c(ei, ■ • ■ , e t) — &{,t\> ■ ■ ■ , e* )• We first show that e t under P and e£ 
under Q are conditionally identically distributed by computing the corresponding cumulant generating functions 
assuming they exist. Let u > and for any z € (—u,u) and using the Bayes rule, we have: 

'"" " -">-,, w/,-= / , ,, , 



t W =E Q 



= E* 



In particular, by taking the first and second order derivatives of the cumulant generating functions at zero, we 
notice that [e*|Ji-i] = and Var Q [e* |J" t -i] = 1. 

In order to show that e£ are conditionally uncorrelated under Q we let I > s > 1. We have: 

Eft [e* t e* t _ s \F t -i] = E® [Eft [e*e*_ s l^t-i] \?t-l] = Eft [e* t _ s E Q [e* t \F t -x] \F t -l] = 0. 



(iii) From equations (2.2) and (2.10) and using the previous result we have: 

fit + K P (<J t ) - r 



Vt=fH + CT t e t = fi t + o t e t 



Q 



r - K?*{<Tt) + <r t e t 



The risk neutral property follows by verifying that the discounted asset prices satisfy the martingale property under 
Q. Indeed 



EQ [e»*LFt-i] = -E° 



i — k i (o- t )+CT t e 

e t 



Replacing e t by — gt into the conditional variance equation (2.3) concludes the proof. 



7.2 Proof of Proposition 2.5 



(i) The proof follows the same steps as in the previous proposition after noticing that: 

E p [JV t |Ji_i] = E p [e etCTtet -< (etCTt) |J- t _il = 1. 
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(ii) Since e* t 



the sigma algebras generated by the two innovation processes coincide. Next, we 



V<"(0*<rt) 

compute the conditional cumulant generating function of t t under Q. Let u > and for any z G (— u, u) we have: 



k«(z) = ln£ Q [e^lJi-x] = ln£ p [e ze 'iV t | = ln£ p 
= n? t (z + 9 t <7 t ) - <(6» t cr t ). 



,zet e e t (7tet-K H {B t o t ) | jr t _ 1 



(7.1) 



Based on the above relation, we compute the first two conditional moments of e t by taking the first and second 
order derivatives of the cumulant generating function at the origin. 

d&(z) 



E^[e t \T t -i] = 
Vzr Q [e t \T t -i] = 



dz 



<'(0t<Tt) 



dz 2 



Pll 



z=0 



Thus, using the definition of e*, we obtain immediately that E® [e^ \Ft-i\ — and Var*^ [e*|J-t_i] = 1. The fact 
that el are conditionally uncorrelated follows in a similar way as in the previous proof. 



(iii) From equation (2.2) it follows that: 



Vt = IM + ate* = Mt + <J t Kf t ' (0 t <Jt) + cr t J K^'(6 t a t )e* t . 



If we denote <r t * = a±\ k^' '{0t<Jt), we only need to show that: 



fit + <T t Kf'(0t<T t ) = r- «£.(<7*), 



0, 



(7.2) 



(7.3) 



in order to obtained the required risk neutralized return dynamic under the conditional Esscher transform. We 
start by evaluation the cumulant generating function of t\ under Q. 

-~f/(flt<* t r 



f,(z) = \nE Q 



• v /"f t "(«t"t) 



Using the above relation, (2.111, and (7.1) we have 



= fj, t + a t Kf t '(6 t (Xt), 



which proves (7.3). The risk neutral variance equation under the Esscher transform is obtained by multiplying the 



physical conditional variance equation (2.3) by \/ Kf t " {Ot&t) ■ 



7.3 Proof of Proposition 3.2 



(i) The value X t at a given time t of the hedge portfolio constructed in the statement is given by: 

X = U t - A? v S t . 

If the hedge is kept constant over the next period, the change in value of the portfolio between two consecutive 
time steps is: 

x t+1 -x t = n t+1 - n t - a™ (St+i - s t ) . (7.4) 
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When the time between the observations is small, we can use the following approximation for the change in the 
option price: 

It follows that the one-period variance to be minimized is approximately given by: 

'dIL t 



d n * / 2 _ , j , 

9 \"t+2 "t+lj 



Var^ (X t+l - X t | T t , 



dS t 



A™" Var^ [S t+ i - S t \ F t ] 



1 |'g - Ar) ^-Cov p (S t+1 - S t ,*l 2 - o 2 t+1 | F t ) 



dc 



t+i 



Var p [a 2 t+2 - a 2 t+1 \ F t ] . 



Taking the derivative of the above expression with respect to A™ v and setting it equal to zero, we have: 

. Cov p (5 t+ i - St, <r 2 +2 - of+i I Ft) 



Af v « A? + A° 



Va,r p [S t+1 -S t | Ft] 



Moreover, if we further approximate the next period change in the conditional variance as 

d(J t+i 



fT t+2 °t+l 



dS t 



{St+i — s t ) , 



and plug it into (7.6) we obtain the second approximation for the minimum variance delta hedge. 



(7.5) 



(7.6) 



(7.7) 



(ii) When the local risk minimization is done with respect the risk neutral measure Q, we have Vt = lit. Thus, 



plugging in the approximations from (7.5) and (7.7) into the formula for the local risk minimization hedge under 
Q, we have: 



t+i 



Cov« (v t+1 ,S t+1 - S t | Ft) _ GoY Q {Vt+1: St+1 | Ft) 



Var Q 



St+i — St | Ft 



Var^ [St+i | F t ] 



a<r 2 t+1 v u t+2 u t+ 



of+i) j S t +i | F t 



Var« [5 t+1 | F t ] 
Cov Q (S t+1 - S t , cr t 2 +2 - a 2 t+1 | J" t ) 
Var Q [S t+1 -S t | Ji] 

Cov Q (V, + (If + (St + i ~ S t ),S t+1 | * 



= A^ + A? 



Var 03 [S t+ i | Ft] 



7.4 Proof of Proposition 4.1 



We compute separately each term in (3.12) in our setup. Firstly, for any fixed i, we assume that certain regularity 
conditions which allow the interchange between the derivative and the expectation are satisfied. Using (4.6), we 
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can write 



Af 



— (S u „ 2 +1 (S t )) = e-rV-D—E? [max{(5 T - K),0}] 



-r(T-t) 



0_ 

dS t 



dS T 



L {St>K} 



-E?[{S T -K)l {ST>K} ]=e-^)E? 
Second, we compute the option vega Af = ^ (S t , <r 2 +1 (S t )) ■ We start by using (4.7| in order to rewrite (4.6| 



St 

s7 



1{S T >K} 



(7.8) 



as 



St — St 



,r(T-t)-lff? +1 +fft+ie£ + i-J[££=l* a A(t+Lt+l)+a^ +1 B(t+l,t+l)}+Y,f =2 t [a A(t+Lt+l)+a^ +1 B(t+l,t+l)] 2 e * 4 



Using this expression it is very easy to compute the derivative: 

0+xi(^-i)fl(t+*,t+i) 



dS 



t St 
2~ 



Z=2 



St 
~2~ 



T-t 

E 

./=i 



4+j 

O't+l 



1 S(t + Z,i + 1) 



(7.9) 



where in the last equality we use the convention B(t+l,£ + l):=l. We now use (7.9) to compute 

dS T , 



Af = /^(5 t ,a^(^)=e-^-*)^^[ma X {(S r -X) ) 0}] = e-^- t )^ 
oa t+i 0(J t+i 



r(T-t) 



«,* + !) 1{S T >K} 



(7.10) 



7.5 Proof of Proposition 4.3 



We only show the first equality for the second one follows in a similar fashion. Let a\ , 1 h (x, y) and e k (x, y) be the 
restrictions of crf, +1 h and tk to the filtration J 7 ^'^- From (4.12) it follows that 

a k+i,h( x r v) =y 2 + a i(y) h + a 2 (y, e k (x, y)) Vh. 
Here the functions a± and a 2 are given by: 

ai{y) = a + y 2 (/? + ai(l+7 2 )-l), 
a.2(y,£k(x,y)) = a x y 2 ((e k {x,y) + jf - (1 + 7 2 )) ■ 

It follows immediately that lim/ l _ i .o Ck+i,h(%i y) — y- First, we compute the denominator of the left hand side 



of (4.21 ) for the Gaussian NGARCH model using 2.5 We have: 



Var' 



Sk+l,h — S k ,h I J^k'h 



= x 2 e 2 ( r+ACTfc+1 ' h ^'^ _ 5 cr fc+i,fc( :E >«))' l Var p e^+x^Vhek+x | jr^v 

— x 1 2 ( ?(r+\<j k + lth (x,y)-\cj 2 k + lh (x,y))h I e K k {2cr h+ i ih (x,y)Vh) _ ^K h ((T k+lth (x,y)s/h) 



= X 2 e 2 ( r + Xa k+i,h(x,y))h f e al +1:h (x,y)h _ _ 

The conditional covariance between changes in asset price and conditional variances is given by: 

C0V P [Sk+l,h - S k ,h, &k+2,h - a l+l,h I ^kJi) = CovP [Sk+l,h, &k+2,h I ^kl 



xa 1 hVhal +1 , h (x,y)e( r+Xak + 1 - h{x ' y) -^ 2 ^^ {x ^y 
Cov p (e^+^+i, (e k+1 + 7 ) 2 I K'j) ■ 
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Using the fact that (e k +i + 1) 2 has a noncentral chi-squared distribution with one degree of freedom and noncen- 
trality parameter 7 2 , it follows that: 

Cov p (e fc+1 + 7 ) 2 | = e5^ +1 (-^V fc+1 ^( 2 ;,y)^( ( 7 fc+1 , ft ( a ;,y)^ + 2 7 ) , 

and therefore, the numerator becomes: 

Cov p (S fc+lifc - S k>h ,oi +2th - a 2 k+hh | F x k %) = xaM r+Xak+lMx > y))h 4+iJ^y) (<r k+1 , h (x, y)Vh + 2 7 ) . 

Taking the limit as h approaches zero we obtain: 

Cov p (S fc+1 , fe - S k , h ,oi +2ih a 2 k+1M | Til) ,. aihal +hh (x,y) (* k+1 , h (x, y)Vh + 2 7 ) 

hm 5 ! ! ! — = hm -, r — = 2ai 7 — . H 

h^o V a r p [S k+1 , h -S k , h | Tl%\ h^o x (^l +1 , h (x,v)h _ ^ ^ 



